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Let X= (Xi,X2,Xz) be a spherically symmetric random vector of which only (X\,X2) can be 
observed. We focus attention on estimating F, the distribution function of the squared radius 
Z := X\ + X% + X%, from a random sample of (X\,X-i). Such a problem arises in astronomy 
where (X\,Xi,Xz) denotes the three dimensional position of a star in a galaxy but we can 
only observe the projected stellar positions (X\,X%). We consider isotonic estimators of F 
and derive their limit distributions. The results are nonstandard with a rate of convergence 
^/ra/logn. The isotonized estimators of F have exactly half the limiting variance when compared 
to naive estimators, which do not incorporate the shape constraint. We consider the problem of 
constructing point-wise confidence intervals for F, state sufficient conditions for the consistency 
of a bootstrap procedure, and show that the conditions are met by the conventional bootstrap 
method (generating samples from the empirical distribution function). 

Keywords: asymptotic normality; consistency of bootstrap; globular cluster; nonstandard 
problem; shape restricted estimation; spherically symmetric distribution 

1. Introduction 

Stereology is the study of three-dimensional properties of objects or matter usually ob- 
served two-dimensionally. We consider such a problem, which arises in astronomy. Sup- 
pose that the position X := (Xi,X2,Xs) of a star within a given galaxy has a spheri- 
cally symmetric distribution and that we observe the projected stellar positions, that is, 
(Xi,X?) (with a proper choice of co-ordinates); and consider the problem of estimat- 
ing the distribution function F of the squared distance Z := X\ + Xf + X\ of a star 
to the center of the galaxy from a random sample of (X\,X2). In this paper, we study 
the statistical properties of three estimators of F. We show that enforcing known shape 
restrictions (monotonicity) in the estimation procedure leads to estimators with lower 
asymptotic variance (exactly by one-half in this case). We also consider the problem of 
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constructing point-wise confidence intervals (CIs) around F, and show that the conven- 
tional bootstrap method can be used to construct valid CIs. Our treatment is similar 
in flavor to Groeneboom and Jongbloed's [6] study of the Wicksell's [13] "Corpuscle 
Problem." 

Suppose that X has a density of the form p(x\ + x\ + x 2 ). Then Y := X\ + X 2 ~ G 
and Z have densities 

g(y)=n [°° -&Ldz (1.1) 

J y v z -y 

and f(z) = 2n^/~zp(z). The reader may recognize (1.1) as Abel's transformation. It may 
be inverted as follows. Let 

V(y)=r^Ldu. 

Jy V u - y 



Then 

p{z) dz 



V(y)=n 



u \Jz-u 



n 2 / p{z)dz (1.2) 



so that p(z) = — V'(z)/n 2 at continuity points. Observe that V is a nonincreasing func- 
tion. The quantity of interest, F, can be related to V and, therefore, to the distribution 
of {X U X 2 ) by 



f x 2 f°° 

F(x) = 2ny/Hp(u) du = 1 + - y/zdV{z), 

Jo ™Jx 



(1.3) 



where the last equality follows from J" °° 2n^Jup{u) du= 1. Relationship (1.3) will be used 
extensively in the sequel. Let 



U(x) := / V(t) dt 
Jo 

for x > 0. Then U is concave since V is nonincreasing. Concavity can also be seen from 



U(x)=2 / { Vm — y/(u — x) + }g(u) du, 
Jo 

where y + = max{y,0}. Let J(t) := \Jz — tdV(z). Then 

G(t)=n / -^Ldydz 

Jo Jo V z ~y 



2n {4~z- y/(z - t) + }p{z)dz = 1 + -J(t), 



(1.4) 



where the last step follows from J Q 2n^/zp(z) dz = 1 and J(t) = —n 2 J t V 1 z — tp(z) dz 
(using (1.2)). 
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Figure 1. Left panel: Plots of V„ (green), V* (red, piece-wise constant) and V (blue, smooth); 
middle panel: F„ (red), F (blue, smooth); right panel: F* (green, piece- wise constant), F n 
(black, piece-wise constant) and F (blue, smooth) from a sample with n = 20 data points. 



Now suppose that we observe an i.i.d. sample {(-2Qii-2Q2)}it=i having the same distribu- 
tion as {X\,Xz). Letting Yi = Xf x + Xf 2 , a natural (unbiased) "naive" estimator of V is 

# r° dG*(u) _ l^m>y} 



V*{y) 



V u ~y n ^-[ V y i-y 



where Gjf is the empirical distribution function (EDF) of the Yi's. Then Vf(y) is an 
unbiased estimator of V(y) for each fixed y; but has infinite discontinuities at the data 
points Yi and is, therefore, not monotonic when viewed as a function of y. See Figure 1. 
We call the naive estimator. The naive estimator can be improved by requiring 
monotonicity. If Vf were square integrable, this could be accomplished by minimizing 
the integral of (W - V*f over all nonincreasing functions W, or equivalently, 

W 2 (y)dy-2 W(y)V*(y)dy. (1.5) 



The function is not square integrable, but it is integrable, so (1.5) is well defined. 
Let Vf be the nonincreasing function W that minimizes (1.5). Existence and uniqueness 
can be shown along the lines of Theorem 1.2.1 of Robertson et al. [9], replacing the sums 
by integrals. Groeneboom and Jongbloed [6] derived the limit distributions of and 
V# : Let xq > and 

e„ := \/n- 1 \ogn 1 (1.6) 

then under appropriate conditions, 

V*(x )-V(x ) 



N(0, 9 (x )), (1.7) 



y^o)-v ixo) ^ N ri_ gM 
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where =>■ denotes weak convergence. 

We can define two estimators of F, F n and Fjf , by replacing V from the right-hand 
side of (1.3) with V# and V#, respectively. Observe that Ff is not even nondecreasing; 
F% is nondecreasing, and max{i^*,0} (as F# < 1), is a valid distribution function and 
a more appealing estimator of F (see Figure 1). 

Yet another estimator of F can be obtained by isotonizing Ff over all nondecreasing 
functions. Let F n be the nondecreasing function that is closest to F&, in the sense 
that it minimizes (1.5) with replaced by Fff . It is not difficult to see that then 
max{0, min(_F n , 1)} is a valid distribution function. Figure 1 shows the graphs of the 
estimators Vf, F#, Ff, and F n obtained from simulated data with n = 20. 

It will be shown later that for xo > 0, 

*)' f <"'^f»i,„(,A (1.9) 







F*(x )- 


F(x ) 






F n {x ) - 


F(x ) 



N( 0,-^x g(x ) ) and (1.10) 



9 ' 

n(q,^xo9(x )] (1.11) 

under modest conditions. As above the isotonized estimators have exactly half limiting 
variances of corresponding naive estimators. 

Construction of confidence intervals for F(xq) using these limiting distributions is still 
complicated as they require the estimation of the nuisance parameter g(xo). Bootstrap 
intervals avoid this problem and are generally reliable and accurate in problems with 
y/n convergence rate (see Bickel and Frcedman [4], Singh [12], Shao and Tu [11] and its 
references). However, conventional bootstrap estimators are inconsistent for some shape 
restricted estimators - dramatically so for the Grenander estimator. See Kosorok [8], 
Abrevaya and Huang [1] and Sen et al. [10] and its references. So, it is not a priori clear 
whether bootstrap methods are consistent in the present context. We show that they are. 

In Section 2, we prove uniform versions of (1.7), (1.8), (1.9), (1.10) and (1.11). These are 
used in Section 3 to establish the consistency of bootstrap methods in approximating the 
sampling distribution of the various estimators ofV and F, while generating samples from 
the EDF. Using data on the globular cluster M62 we illustrate the isotonized estimators 
of F along with the corresponding bootstrap based point-wise CIs in Section 4. Section A, 
the Appendix, gives the details of some of the arguments in the proofs of the main results. 



2. Uniform convergence 

In this section, we prove central limit theorems for estimates of V and F when we have 
a triangular array of random variables whose row-distributions satisfy certain regularity 
conditions. This generalization will also help us analyze the asymptotic properties of the 
bootstrap estimators (to be introduced in Section 3). Note that conditional on the data, 
bootstrap samples can be embedded in a triangular array of random variables, with the 
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nth row being generated from a distribution (built from the first n data points) that 
approximates the data-generating mechanism. 

Suppose that we have i.i.d. triangular data {Y nt i}f =1 having distribution function G n . 
We consider a special construction of Y n j, namely, let Y n j = G^ 1 ^), where G~ 1 (m) = 
inf{x: G n {x) >u} and Ti,T2,... are i.i.d. Uniform(0, 1) random variables. Let V n and 
U n be defined as 

V n (y) = [°° and U n (x) = f V n {y) Ay. 

Jy V U - V JO 

Let LCM/ be the operator that maps a function h : M. — > R into the least concave majorant 
(LCM) of its restriction to the interval / C K. Define V n := LCM[ 0iOO ) [U n ]' where ' denotes 
the right derivative. Let Gjf denote the EDF of Y n> x,Y ni 2, ■ ■ ■ , Y n>n , 

Then is a nonmonotonic, unbiased estimate of V n (y), as above, and we call the 
naive estimator. The naive estimator can be improved by imposing the monotonicity 
constraint as in (1.5) to obtain V#. Observe that 

1=1 

is an unbiased estimate of U n (x) for all x € [0, oo); C/jf is a nondecreasing function; 
is the derivative of U# a.e. Let U# be the LCM of . Then V# is the right-derivative 
of /7,f (see, e.g., Lemma 2 of [6]). Let F n and F 7 f be defined by replacing V from the 
right-hand side of (1.3) with V n and V#, respectively. 

2.1. CLT for estimates of V 

Fix Xo £ (0, oo) such that g(xo) > 0. We consider two estimates of V(xo), namely V#(xq) 
and V^(xo). To find the limit distribution of V&(xo), we assume the following conditions 
on G„: 

V n (x )^V(x ), (2.1) 

n{G n (x + ^ ]-G„(xo)}^0 for all £■>(), (2.2) 

I \ e^nlogn J J 

^°+ c » dG n (y) 

, = o(e n ), (2.3) 

_l r _=^u w , (2.4, 

l ognJ Xo+Cn y-xo 
where c„ = l/(V n logn + V n (xo)) 2 and e n is defined in (1.6). 
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Proposition 2.1. If (2.1)-(24) hold then £- 1 {V r #(a;o) - V„(x )} => JV(0, g(x )). 

The proof of the proposition is given in the Appendix. Next, we study the limiting 
distribution of 

L^n ■ — ) 

where V n (xo) can be V n (xo) or V n (xo). Define the stochastic process 

Zn(*) = £n 2 {U*(x + e n t) - U*(x ) - V n (x )e n t} 

for t G I n '■= [— s^xqiOo) and note that A„ = LCM/ n [Z„]'(0), that is, A„ is the right- 
hand slope at of the LCM of the process Z„. We will study the limiting behavior of 
the process Z n and use continuous mapping arguments to derive the limiting distribu- 
tion of A„. We consider all stochastic processes as random elements in C(R), the space 
of continuous functions on R, equipped it with the Borcl er-ficld and the metric of uni- 
form convergence on compacta. To better understand the limiting behavior of Z„, we 
decompose Z„ into the sum of 

Z»,i(*) = £n 2 {(U* - U n )(x + e n t) - (U* - U n )(x Q )} and 
Z„,2(*) = £n 2 {U„{xo +e n t) - U n (x Q ) - V n (x Q )e n t}. 
Observe that Z„ 2 depends only on G n and not on the Y n j. Let 
Z 1 (t)=tW and Z(t)=Z 1 (t) + ±t 2 V'{x ) 

for tgM, where W is a normal random variable having mean and variance ^g(xo). 
We state some conditions on the behavior of G n , V n and U n used to obtain the limiting 
distribution of A n . 

(a) D n :— \\G n — G\\ = 0(e„), where || • || refers to the uniform norm, that is, \\G n — 
G||=su Pt . K |G n (i)-G(i)|. 

(b) Z n ,2(t) — > ht 2 V'(xo) as n—t oo uniformly on compacta. 

(c) For each e > 0, 

|^«(aro +0) ~ U n (x Q ) - pV n (xo) - \pV'(x )\ < e(3 2 + o(/3 2 ) + 0(4) 
for large n, uniformly in j3 varying over a neighborhood of zero. 

Theorem 2.1. Under condition (a) the distribution of Z nj i converges to that of 
Further, if (b) holds, then the distribution of Z n converges to that ofl. 

Proof. The covariance of Z Wi i(s) and Z ni i(£), is needed. To compute it let 

4>(y,rj) := y/{y-x )+ - y/ (y - x - v)+ 
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for y, r\ £ R and observe the following two properties: 

\<t>'(y,r])\dy<2^ 



(PI) 
(P2) 



of which the second follows from splitting the interval of integration into [0,xo + 77] and 
(xq + 77,00). Observe that 



Z ni i(t) = - ? / <j>{u,e n t)d{G*-G n ){u) 



and 



Cov(Z„ i i(s),Z„ i i(i)) = — j Cov(</>(y„,i,e n s),0(y n ,i,£ n i)), 



where Y n> \ ~ G n . Wc first show that E[cj)(Y n ,i,e n t)] = 0(e n ), so that Cov(Z ni i(s), 
Z n ,i(^)) = (l/n£fj£ , [^(y ri 4,e n s)0(F„ i i,e n f)] +o(l). For this, observe that 

E[<t>{Y n ^e n t)] = 2 J ' 4>(u,e n t)d{G n - G){u) + {U(x + te„) - U(x )}. (2.5) 

The first term is at most 

pOO p 00 

2/ {G n -G){u)4>'{u,e n b)&u<2D n \<j>'{u,e n t)\du 
Jo Jo 



= 20(e„)2v^M=0(ef/ 2 ), 

and the second term in (2.5) is at most 0(e„) by using a one term Taylor expansion. 
Next, suppose that s < t and write E[<p{Y n ^\, e n s)<j)(Y n .i , e n t)J as 



(u,£ n s)<f>(u,£ n t)d(G n - G)(u) + / (f)(u,E n s)<f>(u,£ n t)dG(u) 



From Lemma 3 of Groeneboom and Jongbloed [6], page 1539, 



1 



(u,e n s)(/)(u,£ n t)dG(u) = -^g(x Q )stel loge„ + 0(4). 



Using integration by parts, (PI), and (P2), the first term in (2.6) is at most 
{(j>'(u,£ n s)4>(u,e n t) +(f)(u,£ n s)(/)'(u,s n t)}(G n - G)(u)du 



(2.6) 



(2.7) 



< 2D n ^\e~t\{2^/\e^7\} = 0(e 2 n ). 
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So, 



ne 
1 



Cov(Z n , 1 (a) > Z n ,i(t)) = — j-^ -stg{x )ei\og[ — )+0(e 2 n ) 



(x )st< 1 - 



log log n 
log n 







logn 



(2. 



It follows directly from the Lindeberg-Feller central limit theorem for triangular ar- 
rays that Z n> i(l) =>■ N(0,^g(xo)); and Chebyshev's inequality implies that \sZ n ,i(t) — 
iZ„.i(s)| = op(l) asn->oo for all for all fixed s,t £ R. So, the finite dimensional distri- 
butions of Z ni i converges weakly to the finite dimensional distributions of Zi. 

For the the convergence in distribution of Z TCj i to Zi in C(R), it suffices to show that 
for each M > and sequence of positive numbers {S n } converging to zero, 



£{sup|Z n ,i(s)-Z n ,i(i)|: 



t\ < S„,mnx(\s\, \t\) < M} -t 0. 



See Theorem 2.3 of Kim and Pollard [7]. Consider the class of functions Cr = 
{</>(•, rj): \rj\ < R} with its natural envelope <&n(y) := yj(y — xo + R)+ — \J{y — xq — R)+- 
Observe that Cr are uniformly manageable for its envelope &r and that &r < \[2R. Let 
5 n be a sequence of positive numbers converging to zero, h(y;s,t) := <f>(y,t) — (j>{y,s) = 
v / (y-x -s) + - y/(y-x -t)+ for y,s,t£R, and H„ := {h(-; se n ,te n ): max(|s|, \t\) < 
M, \s — t\< S n }. The class H n has envelope H n := 2$Me n . Observe that 

Z n!l (t) = 2e- 2 (G* -G n )<f>(-,te n ). 

So, it suffices to show that £~ 2 £'[sup ft6Wn \(Gf — G n )h\] = o(l). Define S„ := G*i/^/(ne^) 
and T n := sup^ e ^ G#h 2 . Then by the maximal inequality of Section 3.1 in Kim and 
Pollard [7], there is a (single) continuous function J(-) for which J(0) = 0, J(l) < oo, and 



C M 



sup 

hen n 



G n h\ 



< 



-.E 



= E 



GtHl J (sup 
n£ n S n 



G*h 2 



Let 77 > 0. Splitting according to whether {SVi < ?7} or not, using the fact that ne^Sn > T" n 
and invoking the Cauchy-Schwarz inequality for the contribution from {S n > rj}, we may 
bound the last expected value by 



E 



SnHS n < 77} J 



ne^Sn 



E 



l{S n > n}J 



ne^Sn 



<y/vJ(l) + VEK\ EJ 2 min 1, 



ne*r] 
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Noting that $M Er , 
we have 



(•,Me„) -</>(•, -Me n ) and using (2.7) and (2.8) with - s = t = M, 



-E5V, 



net 



0(1). 



(2.9) 



So, it suffices to show that T n = op(nef l ), which implies E[J 2 (min(l,T n / (ne^-q))] 
(note that J(l) < oo). We will establish the stronger result ET n — o(ne*). Observe that 



E 



sup G#h* 



<E sup G n h 2 + E 

H„ 



sup \G* h 2 -G n h 2 \ 



and 

G n h 2 = G n [(f>(y,te n ) - cf>(y, se n )} 2 = -±g(x )(s - t) 2 e 2 n \oge n + 0(4) 
= 0(6lnet l )+0(e 2 n )=o(ne 4 n ) 

by (2.7). The maximal inequality applied to the uniformly manageable class {ft 2 : h £ T-L n } 
with envelope H 2 bounds the second term by J(l)y/G n H%/n < 8Ale n /\/n — o(ne^), 
where we have used (2.9) and the fact that H 2 < 8Ms n . That Z„ converges in distribution 
to Z in C(R) follows directly. □ 



A rigorous proof of the convergence of A„ involves a little more than an application 
of a continuous mapping theorem. The convergence Z n => Z is only in the sense of the 
metric of uniform convergence on compacta. A concave majorant near the origin might 
be determined by values of the process long way from the origin; the convergence Z„ => Z 
by itself does not imply the convergence LCM; n [Z„] LCMr[Z]. We need to show that 
LCM^JZn] is determined by values of Z„ for t in an Op(l) neighborhood of the origin. 
Corollary 2.1 shows the convergence of A„, and its proof is given in the Appendix. 

Corollary 2.1. Under conditions (a)-(c), the distribution of A„ converges to that of 
V7 = LCM R [Z]'(0). 



2.2. CLT for estimates of F 

We consider three estimates of F, namely F#, Ff and F#, where F# and F# are 
obtained by replacing V from the right-hand side of (1.3) with Vf and V^, respectively; 
and Ff is the closest (in the sense of minimizing (1.5) with V n replaced with F#) 
nondecrcasing function to Ff. We start by deriving the limit distribution of F# . Let 
cr 2 := Var[sin _1 y/lA{x Q /Y)] where Y ~ G. 

Proposition 2.2. If g(xo) > and \\G n — G\\ — s- as n^r oo, then 

Jxo 2 V" 
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If also (2.1)-(24) hold, then 

F#(x )~F n (x ) 



4 

N[ 0, — x g(x ) 



(2.11) 



Proof. For (2.10), observe that 

V*{u) J 1^ f°° l{Y nii >u} J 



9 -n ^— ' A/ V , 



2y/uy/Y nt i - u 2 n ^ 



after some simplification, and (similarly), 



V„(u) 7t 
=r dU — 

2y^ 



Relation (2.10) now follows from the Lindebcrg-Feller CLT. For (2.11), first observe that 
Ff{xo) — F n (xo) may be written as 



2 2 
— y/xo{V*(x ) - V n (x )} 

7T 7T 



V*{u)-V n {u) 

2y/u~ 



du. 



From Proposition 2.1, e~ x {V*{x ) ~ V n (x )} => N(0,g(x o )). Relation (2.11) follows di- 
rectly from this and (2.10). □ 

Applying the proposition with G n = G verifies (1.9). Next, we derive the limiting 
distribution of F# . 

Proposition 2.3. Suppose that (a)-(c) hold with V n = V n , then, 
F*(xo)-F n (x ) 



N[ 0, -^x a g(x a ) 



71- 



Proof. As above F#(xq) — F n (xo) may be written as 



-V^{Vn(x a )~V*(x )} + - 
7T 7T 



V n (u)-V#(u) 



du. 



From Corollary 2.1, e n 1 {V n {xo) — V#(xq)} => N(0,^g(xo)). Integrating by parts, the 
integral on the last display is a most 



\U n (x )-U#(x )[ 1 
4 



< 



\Un~U#\ 

2Jx^ 



U n (u)-U#(u) 



.3/2 



du 



+ \\U n -U*\\ 



2Jx^ 
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_\\u n -u*\\ < \\u n -u$\\ 



(Xq y/X 

= P (n- 1 / 2 )-op(e„) 

by Marshall's lemma and maximal inequality 3.1 of Kim and Pollard [7] (to bound 
\\U n — f/jf ||). The proposition follows. □ 

Now let H n (x) := f* F n (z) dz and H#(x) := F#(z) dz. Note that F# is the deriva- 
tive of H# a.e. Let be the greatest convex minorant (GCM) of H* . Then F n is the 
right-derivative of H#. We want to study the limit distribution of 

_ F n (x ) - F n (x ) 
£ n 

where F n can be F n or F n . Let 

X„(i) := e ? ; 2 {i7,f (.t + e n t) - H*(x ) - F n {x )e n t} 
for t G Z n := [— e~ xq, oo). As before, we decompose X„ into X n> i and X Wj 2 where 

X„,i(i) := e r ; 2 {(H ? f - H„)(x + e n t) - (H* - H n )(x )} and 

X„ )2 (i) := £„; 2 {i?„(a;o + e„i) - H n (x ) - F n (x Q )e n t}. 

Let GCMj be the operator that maps the restriction of a function h : M. — > K to the 
interval / into its GCM, and observe that A„ = GCM/ n [X„]'(0). Also let 

Xi(t)=tW and X(i) =X x (t) + \t 2 f{x G ) 

for t G R, where is a normal random variable having mean and variance 2xog(xo)/n 2 
and / is the density of Z = X 2 + X 2 + X 2 . The following conditions will be used. 

(b') X n> 2(i) — > 7}t 2 f(xo) as n— > oo uniformly on compacta. 
(c') For each e > 0, 

\H n (x + *3) - H n (x ) - PF n (x ) - \p 2 f(x )\ < efi 2 + o((3 2 ) + 0(e 2 n ) 

for large n, uniformly in ft varying over a neighborhood of zero. 

Theorem 2.2. Under condition (a), the distribution of X^i converges to that o/Xi. 
Further, if (b') holds, then the distribution o/X„ converges to that o/X. 



Proof. Using the definitions of F#, H n and we may write H#(x) — H n (x) as 



J z 
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and 



(V*-V n )(u) 



2Ju 



da 



< 



2^y - uy/u 



71 



d(G*-G n )(y) 



■d(G*-G n )(y) 



d 

dy 



(G*-G)(y)dy 



71 



-\\G* - G„|| =o(e„) a.s., 

using the Law of Iterated Logarithms for ||G* — G n \\ = o(e„) a.s. Fix a compact set 
K = [-M, M] . Then 



X„,i(i) 



2 /-a:o+£„i 
7T 



Vz[V*(z)-V n (z)] 



dz + o(l) 



J x 



71 



(2.12) 



-y/x^Zn.^t) +Op(l) 



71 



uniformly on K. Note that (2.12) follows as \ Q {yfz - y/xo)[V* (z) - V n {z)]/e 2 n dz\ 
can be bounded, using integration by parts, by 



rx +e n t dz 

\y/x + e n t - y/xE\ max |Z„,i(s)| + max \1 n ,i(s)\ / T7 ==o P (l), (2.13) 

J x 



\<M 



\s\<M 



2^ 



as max| s |< M |Z nj i(s)| = Op(l). The theorem now follows. 



□ 



Corollary 2.2. Under conditions (a), (b') and (c') ; the distribution of A n converges to 
that of W = GCM R [X]'(0). 

The proof is very similar to that Corollary 2.1 with the LCMs changed to GCMs. The 
modifications are outlined in the Appendix. 



3. Consistency of the bootstrap 

We begin with a brief discussion on the bootstrap. Suppose we have i.i.d. random vari- 
ables (vectors) T\ , T% , . . . , T n having an unknown distribution function A defined on a 
probability space (Q,A, P) and we seek to estimate the sampling distribution of the ran- 
dom variable R n (T n , A), based on the observed data T„ = (Ti, T 2 , . . . , T n ). Let [i n be the 
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distribution function of i?„(T„, A). The bootstrap methodology can be broken into three 
simple steps: 

Step 1: Construct an estimate A n of A based on the data (for example, the EDF). 

Step 2: With A„ fixed, draw a random sample of size n from A n , say T* = 
(Ti, T% , . . . , T*) (identically distributed and conditionally independent given T„). 
This is called the bootstrap sample. 

Step 3: Approximate the sampling distribution of i?„(T„,A) by the sampling distri- 
bution of i?* =i?„(T*,A„). The sampling distribution of i?*, the bootstrap distri- 
bution, can be simulated on the computer by drawing a large number of bootstrap 
samples and computing i?* for each sample. 

Thus the bootstrap estimator of the sampling distribution function of i? n (T„,A) is 
given by \x* n (x) = P*{R* l < x} where P*{-} is the conditional probability given the data 
T„. Let L denote the Levy metric or any other metric mctrizing weak convergence of 

distribution functions. We say that /i* is (weakly) consistent if L(/x„,/i*) — > 0. Similarly, 
[i* n is strongly consistent if L(/Lt n ,/Lt*) — > a.s. If \i n has a weak limit fi, for the bootstrap 
procedure to be consistent, /i* must converge weakly to /i, in probability. In addition, if 
\i is continuous, we must have 

sup (x) — fi(x)\ — >• as n^-oo. 



3.1. Bootstrapping V n 

Given data Y%,Y2, Y n ~ G let Gf denote its EDF. Suppose that we draw condition- 
ally independent and identically distributed random variables Y* 1 , Y* 2 , . . . , Y* n having 
distribution function G% ; and let G* be the EDF of the bootstrap sample. Letting 

V *(y) :=- Y , 1 = f^M dG * {u) and 
2 ™ / 



= 2 J {y/H-y/(u-x) + }dG* n (u), 

the isotonic estimate of V based on the bootstrap sample is V* = LCM[ 0iOO ) [U*]' . The 
bootstrap estimator of the distribution function of A„ = £^ 1 {V^ l (a- ) — V(x )} is then 
the conditional distribution function of A* := e~ 1 {l^'(xo) — V n (xo)} given the sample 
Y u . . . , Y n . To find its limit let 



Kit) = e- 2 {U* n (x Q + e n t) - U*(x ) - V n (x )e n t} 
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for t 6 I n := [— e n xq,oo) and decompose Z* into Z* j and Z* 2 where 

= e n 2 {iK - U*){x +e n t) - (U* - U#)(x )}, 
K,S) = e- 2 {U*(x +e n t) - U*(x Q ) - V*(x Q )s n t}. 

Recall that Zi(i) = and Z(t) = Zi(i) + \t 2 V {x ) are two processes defined for t £ R, 
where W is a normal random variable having mean and variance ^g(xo). Let Y = 
(Yi, Y2, . • •)■ The following theorem shows that bootstrapping from the EDF G* is weakly 
consistent. 

Theorem 3.1. Suppose that V is continuously differ entiable around Xq, and g(xo) =/= 0. 
Then: 

(i) The conditional distribution of the process Z* 1 , given Y, converges to that 0/Z1 

ffl.S. 

(ii) Unconditionally, Z* 2 (t) converges in probability to ^t 2 V'(xo), uniformly on com- 
pacta. 

(hi) The conditional distribution of the process Z* 7 given Y, converges to that of 1, 
in probability. 

(iv) The bootstrap procedure is weakly consistent, that is, the conditional distribution 
of A* , given Y ; converges to that of W , in probability. 

Proof. Assertion (i) follows directly from Theorem 2.1, applied with G n = Gjf , Gjf = G* 
and P{-} = P*{-} = ^{'lY}, since condition (a) required for Theorem 2.1 holds a.s. For 
(ii) and (iii), let 

=£n 2 {U* (x +te n ) - U*{x ) - e n W{x )} 

for t € I n . By Theorem 2.1, applied with G n — G, V n = V and U n — U for all n, Z^ 
converges in distribution to Z. To prove (ii) observe that 

Z;, 2 (i)=Z°(t)-t-LCM /n [Z0]'(0). 

Unconditionally, using the continuous mapping theorem along with a localization argu- 
ment as in Corollary 2.1, we obtain Z* n 2 (f) => Z(i) - t ■ LCM K [Z]'(0) = \t 2 V (x ). As the 

limiting process is a constant, Z* 2 (i) A ^t 2 V'(xo). Let be a subsequence of N. We 
will show that there exists a further subsequence such that conditional on Y, Z„ Z a.s. 
along the subsequence. Now, given {n^}, there exists a further subsequence {n^} such 
that Z* fc 2 (f) — y \t 2 V (xo) uniformly on compacta a.s. Thus, the conditional distribution 

of Z* fc given Y, converges to that of Z, for a.e. Y. This completes the proof of (iii). 

For (iv), we use Corollary 2.1. Although conditions (a) and (b) hold in probability, 
condition (c) holds with V n = V n and the O(e^) term replaced by Op(e 2 l ). Thus we 
cannot appeal directly to Corollary 2.1. Let £ > and 77 > be given. We will show 
that there exists N & N such that for all n > N, P{L(K,K*) >£}<??, where L is the 
Levy metric (Gnedenko and Kolmogorov [5], page 33), K is the distribution function of 
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W ~ N(0, ^g(xo)) and K* is the distribution function of A*, conditional on the data. 
For e > 0, sufficiently small, let 

An := {\U*(x +(3) - U#(x )-pV*(x ) - \p 2 V'{x )\ < Cel+e(3 2 }, 

where C > is chosen such that P{A n } < |. This can be done since (c) holds with 0(e n ) 
term replaced by P (e*). Further, let P°(E)(u) = P*(E)(u), if w G A„ and P°(E)(lj) = 
P{E), if w ^ A„; and let -KT° be the distribution function of A* under the probability 

pO 

measure P°. Observe that if„ = if* on A„ and that L(K, K °) -4 by Corollary 2.1 can 
be applied. Therefore, for all sufficiently large n, 

P{L(K,K*)>0 < pU(K,K° n ) > ^+P^L(K° n ,K*) > | 

<| + p|i(^,^)>| ) ^|<r ? . 

This completes the proof of (iv). □ 

Remark 3.1. Let J n (t) = f™ y/z~tdV#{z), for t > 0, as in (1.4). Then G# = 1 + 
2Jn(t)/n after some simplification. So, using (1.4) to generate the bootstrap sample 
would lead back to Gf . 



3.2. Bootstrapping F n , F n and F n 

Bootstrap versions of the three estimators of F under study, F*, F* and F* say, are 
defined as in Section 2.2; for example, F*(x) = 1 + (2/n) y/zdV*(z). We approxi- 
mate the sampling distribution of e~ 1 {Ff (xq) — F(xq)} by the bootstrap distribution of 
e~ 1 {F*(xo) — F#(xq)}- The bootstrap samples are generated from the EDF of the 
Yi's. By appealing to Proposition 2.2 with G n = Gf, it is easy to see that the bootstrap 
method is weakly consistent as (2.1)-(2.4) hold in probability. 

The sampling distribution of e~ 1 {F^ z (xo) — F(xq)} is approximated by that of 
e^ 1 {F*(x$) — F n (xo)}. Using Proposition 2.3, we can establish the consistency of the 
method. Note that the proof of Theorem 3.1 shows how conditions (a)-(c) are satisfied 
with G n = Gf , V n = V n required to apply Proposition 2.3. 

Recall that F* is the nondecreasing function closest to F*. Let H#(x) := f F#(z) dz 
and H*(x) := L F*(z)dz. Next, we show that approximating the distribution of A„ = 
e~ 1 {F n {xo) — F(xq)} by the bootstrap distribution of A* := £~ 1 {F*(xq) — F n (xo)} is 
consistent. To find the limit of the conditional distribution of A* , let 

X* n (t) = e- 2 {H*(x + e n t) - H*(x ) - F n (x )e n t} 

for t G /„ := [— e~ Xo, oo) and decompose it into X* 1 and X* 2 , where 

K,iV) = tn 2 {(K - H#)(x +e n t) - (H* - H#)(x )}, 
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KAt) = e- 2 {H*(x +e n t) - H#(x ) - F n (x )e n t}. 

Recall that Xi(i) = tW and X(t) = Xi(t) + ^t 2 f(xo) are two processes defined for t6l, 
where W is a normal random variable having mean and variance xoS^o ). 

Theorem 3.2. Suppose that F is continuously differentiable around xq, and gixo) 7^ 0. 
TTien; 

(i) The conditional distribution of the process X* 1; given Y, converges to that 0/X1 
a.s. 

(ii) Unconditionally, X* 2 (t) converges in probability to ^t 2 f(xo), uniformly on com- 
pacta. 

(hi) TTie conditional distribution of the process X*, given Y, converges to that 0/ X, 
m probability. 

(iv) TTie bootstrap procedure is weakly consistent, that is, the conditional distribution 
of A* , given Y, converges to that of W , in probability. 

Proof. The proof is very similar to that of Theorem 3.1. To hnd the conditional dis- 
tribution of X* 1 given Y, we appeal to Theorem 2.2 with G n =G*, G* = G* and 
P{-} = P*{-} = P{-\Y}. Note that condition (a) required for Theorem 2.2 holds a.s. We 
express X* i2 (t) as X° (t) - t • GCM/„ [X°]'(0) where 

X°(t) = s- 2 {H*(x +ts n ) - H#(x ) - F(x )e n t}. 

Note that unconditionally X° converges in distribution to X by an application of Theo- 
rem 2.2 with G n — G, F n = F and H n = H for all n. 

Unconditionally using the continuous mapping theorem along with a localization ar- 
gument as in Corollary 2.1, we obtain X* 2 {t) => X(t) - t ■ GCM R [X]'(0) = \t 2 f(x ). As 

the limiting process is a constant, X* 2 {t) ^ \t 2 f{xo). 

An argument using subsequences as in the proof of (iii) of Theorem 3.1 shows that 
the conditional distribution of the process X* , given Y, converges to that of X, in prob- 
ability. The last part of the theorem follows along similar lines as in the proof of (iv) of 
Theorem 3.1. □ 



4. Data application 

A globular cluster (GC) is a spherical collection of stars that orbits a galactic core as a 
satellite. GCs are very tightly bound by gravity, which gives them their spherical shapes 
and relatively high stellar densities toward their centers. The study of the inner Galactic 
GCs is important for several reasons - to understand the morphology of the inner Galaxy, 
to better constraint the characteristics of the Galactic bulge, etc. Data is available on 
individual stars in 25 globular clusters located toward the center of the Milky Way (see, 
e.g., Alonso [2] and Alonso et al. [3]). The left panel of Figure 2 shows the projected 
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Projected positions of the stars Estimates of F 




Figure 2. Left panel: Scatter plots of the projected positions of the stars; right panel: (red), 
Ff (black, piece- wise constant). 



positions of n = 2000 stars in the inner core of the globular cluster M62 (also known as 
NGC 6266). Interest focuses on estimating the distribution function F of the squared 
radius. The naive estimator of F, F#, is shown in the right panel of Figure 2 along 
with the isotonized estimator Ff. The two isotonic estimators F# and F n are virtually 
indistinguishable, and the left panel of Figure 3 shows the difference between the two 
estimators. Note that both the isotonic estimators have the same pointwise normal limit 
distribution. The right panel of Figure 3 shows the point-wise bootstrap based 95% CIs 
for F using the estimator Ff. A very similar plot is obtained using the estimator F n . 




5 5 10 15 20 25 30 35 40 5 10 15 20 25 30 35 40 

z z 

Figure 3. Left panel: F„ — F n ; right panel: Bootstrap based 95% pointwise CIs around F„ . 
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